Orthogonal least squares based fast feature selection for linear classification
نویسندگان
چکیده
An Orthogonal Least Squares (OLS) based feature selection method is proposed for both binomial and multinomial classification. The novel Squared Correlation Coefficient (SOCC) defined on Error Reduction Ratio (ERR) in OLS used as the ranking criterion. equivalence between canonical correlation coefficient, Fisher’s criterion, sum of SOCCs revealed, which unveils statistical implication ERR first time. It also shown that has speed advantages when applied greedy search. comprehensively compared with mutual information methods embedded using synthetic real world datasets. results show always top 5 among 12 candidate methods. Besides, can be directly to continuous features without discretisation, another significant advantage over
منابع مشابه
Fast orthogonal least squares algorithm for efficient subset model selection
An efficient implementation of the orthogonal least squares algorithm for subset model selection is derived in this correspondence. Computational complexity of the algorithm is examined and the result shows that this new fast orthogonal least squares algorithm significantly reduces computational requirements.
متن کاملDeterministic Feature Selection for Regularized Least Squares Classification
We introduce a deterministic sampling based feature selection technique for regularized least squares classification. The method is unsupervised and gives worst-case guarantees of the generalization power of the classification function after feature selection with respect to the classification function obtained using all features. We perform experiments on synthetic and real-world datasets, nam...
متن کاملOrthogonal-Least-Squares Forward Selection for
The objective of modelling from data is not that the model simply fits the training data well. Rather, the goodness of a model is characterized by its generalization capability, interpretability and ease for knowledge extraction. All these desired properties depend crucially on the ability to construct appropriate parsimonious models by the modelling process, and a basic principle in practical ...
متن کاملFast SFFS-Based Algorithm for Feature Selection in Biomedical Datasets
Biomedical datasets usually include a large number of features relative to the number of samples. However, some data dimensions may be less relevant or even irrelevant to the output class. Selection of an optimal subset of features is critical, not only to reduce the processing cost but also to improve the classification results. To this end, this paper presents a hybrid method of filter and wr...
متن کاملOrthogonal vs. uncorrelated least squares discriminant analysis for feature extraction
In this paper, a new discriminant analysis for feature extraction is derived from the perspective of least squares regression. To obtain great discriminative power between classes, all the data points in each class are expected to be regressed to a single vector, and the basic task is to find a transformation matrix such that the squared regression error is minimized. To this end, two least squ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Pattern Recognition
سال: 2022
ISSN: ['1873-5142', '0031-3203']
DOI: https://doi.org/10.1016/j.patcog.2021.108419